Overview

Dataset statistics

Number of variables22
Number of observations17107
Missing cells22043
Missing cells (%)5.9%
Duplicate rows220
Duplicate rows (%)1.3%
Total size in memory2.9 MiB
Average record size in memory177.0 B

Variable types

Numeric15
Categorical4
DateTime2
Boolean1

Alerts

Dataset has 220 (1.3%) duplicate rowsDuplicates
post_text has a high cardinality: 13838 distinct values High cardinality
post_id is highly correlated with reaction_countHigh correlation
likes is highly correlated with comments and 6 other fieldsHigh correlation
comments is highly correlated with likes and 1 other fieldsHigh correlation
shares is highly correlated with likes and 1 other fieldsHigh correlation
reaction_count is highly correlated with post_id and 7 other fieldsHigh correlation
Like is highly correlated with reaction_count and 5 other fieldsHigh correlation
Love is highly correlated with reaction_count and 5 other fieldsHigh correlation
Wow is highly correlated with likes and 6 other fieldsHigh correlation
Sad is highly correlated with likes and 6 other fieldsHigh correlation
Angry is highly correlated with likes and 6 other fieldsHigh correlation
Care is highly correlated with likes and 6 other fieldsHigh correlation
likes is highly correlated with reaction_count and 4 other fieldsHigh correlation
reaction_count is highly correlated with likes and 4 other fieldsHigh correlation
Like is highly correlated with likes and 4 other fieldsHigh correlation
Love is highly correlated with likes and 4 other fieldsHigh correlation
Wow is highly correlated with likes and 3 other fieldsHigh correlation
Sad is highly correlated with AngryHigh correlation
Angry is highly correlated with SadHigh correlation
Care is highly correlated with likes and 3 other fieldsHigh correlation
likes is highly correlated with comments and 1 other fieldsHigh correlation
comments is highly correlated with likesHigh correlation
reaction_count is highly correlated with likes and 6 other fieldsHigh correlation
Like is highly correlated with reaction_count and 5 other fieldsHigh correlation
Love is highly correlated with reaction_count and 5 other fieldsHigh correlation
Wow is highly correlated with reaction_count and 5 other fieldsHigh correlation
Sad is highly correlated with reaction_count and 5 other fieldsHigh correlation
Angry is highly correlated with reaction_count and 5 other fieldsHigh correlation
Care is highly correlated with reaction_count and 5 other fieldsHigh correlation
post_id is highly correlated with usernameHigh correlation
likes is highly correlated with reaction_count and 4 other fieldsHigh correlation
comments is highly correlated with reaction_count and 5 other fieldsHigh correlation
shares is highly correlated with Like and 2 other fieldsHigh correlation
username is highly correlated with post_id and 2 other fieldsHigh correlation
reaction_count is highly correlated with likes and 6 other fieldsHigh correlation
Like is highly correlated with likes and 7 other fieldsHigh correlation
Love is highly correlated with likes and 6 other fieldsHigh correlation
Wow is highly correlated with likes and 8 other fieldsHigh correlation
Sad is highly correlated with comments and 6 other fieldsHigh correlation
Angry is highly correlated with likes and 2 other fieldsHigh correlation
Care is highly correlated with comments and 6 other fieldsHigh correlation
Weekday is highly correlated with DayHigh correlation
Day is highly correlated with username and 1 other fieldsHigh correlation
Hour is highly correlated with usernameHigh correlation
timestamp has 9467 (55.3%) missing values Missing
likes has 2333 (13.6%) missing values Missing
reaction_count has 10243 (59.9%) missing values Missing
likes is highly skewed (γ1 = 37.44574063) Skewed
comments is highly skewed (γ1 = 71.48847167) Skewed
shares is highly skewed (γ1 = 48.03356063) Skewed
Love is highly skewed (γ1 = 25.25020846) Skewed
Angry is highly skewed (γ1 = 20.46354447) Skewed
Care is highly skewed (γ1 = 25.43157297) Skewed
comments has 1797 (10.5%) zeros Zeros
shares has 2508 (14.7%) zeros Zeros
Like has 10243 (59.9%) zeros Zeros
Love has 10979 (64.2%) zeros Zeros
Wow has 12182 (71.2%) zeros Zeros
Sad has 12328 (72.1%) zeros Zeros
Angry has 12232 (71.5%) zeros Zeros
Care has 12435 (72.7%) zeros Zeros
Minute has 875 (5.1%) zeros Zeros
Hour has 181 (1.1%) zeros Zeros

Reproduction

Analysis started2021-12-10 21:32:36.728097
Analysis finished2021-12-10 21:33:26.318077
Duration49.59 seconds
Software versionpandas-profiling v3.1.1
Download configurationconfig.json

Variables

post_id
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct15722
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.834233898 × 1015
Minimum1.056919014 × 1014
Maximum1.016044166 × 1016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum1.056919014 × 1014
5-th percentile1.985850383 × 1014
Q14.06963561 × 1014
median8.636305042 × 1014
Q34.243907789 × 1015
95-th percentile1.015933768 × 1016
Maximum1.016044166 × 1016
Range1.005474976 × 1016
Interquartile range (IQR)3.836944228 × 1015

Descriptive statistics

Standard deviation3.622284461 × 1015
Coefficient of variation (CV)1.278047117
Kurtosis0.05365032852
Mean2.834233898 × 1015
Median Absolute Deviation (MAD)5.86595757 × 1014
Skewness1.304723119
Sum-6.854992929 × 1018
Variance1.312094472 × 1031
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.015921876 × 10165
 
< 0.1%
1.015921876 × 10165
 
< 0.1%
1.015922523 × 10164
 
< 0.1%
1.04504267 × 10153
 
< 0.1%
5.012509713 × 10143
 
< 0.1%
2.483759837 × 10143
 
< 0.1%
1.83446918 × 10153
 
< 0.1%
5.414053838 × 10143
 
< 0.1%
8.794811063 × 10143
 
< 0.1%
9.664029906 × 10143
 
< 0.1%
Other values (15712)17072
99.8%
ValueCountFrequency (%)
1.056919014 × 10141
< 0.1%
1.058777581 × 10141
< 0.1%
1.066903514 × 10141
< 0.1%
1.067466514 × 10141
< 0.1%
1.068339479 × 10141
< 0.1%
1.076193145 × 10141
< 0.1%
1.078697413 × 10141
< 0.1%
1.093922079 × 10141
< 0.1%
1.099065109 × 10141
< 0.1%
1.104335342 × 10141
< 0.1%
ValueCountFrequency (%)
1.016044166 × 10161
< 0.1%
1.016044166 × 10161
< 0.1%
1.016044161 × 10161
< 0.1%
1.016044161 × 10161
< 0.1%
1.01604416 × 10161
< 0.1%
1.016044157 × 10161
< 0.1%
1.016044156 × 10161
< 0.1%
1.016044155 × 10161
< 0.1%
1.016044147 × 10161
< 0.1%
1.016044147 × 10161
< 0.1%

post_text
Categorical

HIGH CARDINALITY

Distinct13838
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Memory size267.3 KiB
 
351
#Diga_Diga
 
80
#BusinessNews
 
54
🔶 آخر الأخبار الرياضية في الكرونوسبور #Chronosport #ShemsFm
 
37
#DigaDiga
 
37
Other values (13833)
16548 

Length

Max length1916
Median length82
Mean length104.7989127
Min length0

Characters and Unicode

Total characters1792795
Distinct characters827
Distinct categories21 ?
Distinct scripts4 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12163 ?
Unique (%)71.1%

Sample

1st row⚽️ 🎁 1200 دينار كاش للربح مع Tunisiabet.net بعد ايقاف مقابلة #ليون و #مارسيليا ، التفاصيل في الفيديو 👇👇 Tunisiabet.net #Stade_Med by #Radio_Med #tunisiaBet
2nd rowاخماد حريق بنزل بالحمامات الشمالية
3rd rowيتجدد موعدكم مع #مروى_المعموري من الاثنين للجمعة في #Be_Cool من ال-19 🕖 حتى ال-21🕘 👈 تبعونا تلقاو كل ما يهمكم مع برشا موجات ايجابية على #راديو_ماد ----------------- Suivez-nous 🎙 | Live : https://radiomedtunisie.com/LIVE 📸 | Instagram : https://www.instagram.com/radiomedtn/ 📱 | Facebook : https://www.facebook.com/RadioMedTunisie
4th rowتونس تستضيف فعالية “صنع في ليبيا” لدعم شراكة البلدين
5th row#L'affiche : نقترحو عليكم سلسلة تاريخية من أنجح الأعمال الكورية الجنوبية (المملكة)

Common Values

ValueCountFrequency (%)
351
 
2.1%
#Diga_Diga80
 
0.5%
#BusinessNews54
 
0.3%
🔶 آخر الأخبار الرياضية في الكرونوسبور #Chronosport #ShemsFm37
 
0.2%
#DigaDiga37
 
0.2%
أخبار السابعة صباحاً مع بسمة سعايدية Nouvelles de sept heures du matin avec Basma Saidia29
 
0.2%
أخبار منتصف النهار مع بسمة سعايدية Nouvelles de mi-journée avec Basma Saidia25
 
0.1%
معرض الصحافة مع منال جاء بالله21
 
0.1%
برنامج Perla Kids مباشرة من فضاء هدهود20
 
0.1%
اشري تربح مع خالد وفرج وصابر20
 
0.1%
Other values (13828)16433
96.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
9494
 
3.3%
في7632
 
2.7%
و5751
 
2.0%
من4282
 
1.5%
مع4207
 
1.5%
على3443
 
1.2%
تونس1445
 
0.5%
؟1109
 
0.4%
اليوم1052
 
0.4%
عن1010
 
0.4%
Other values (41871)246784
86.2%

Most occurring characters

ValueCountFrequency (%)
245618
 
13.7%
ا169165
 
9.4%
ل122681
 
6.8%
ي96359
 
5.4%
م68994
 
3.8%
و61830
 
3.4%
ر57006
 
3.2%
ن55141
 
3.1%
ت46151
 
2.6%
ب45390
 
2.5%
Other values (817)824460
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter1107226
61.8%
Space Separator245618
 
13.7%
Lowercase Letter233015
 
13.0%
Other Punctuation53153
 
3.0%
Uppercase Letter42823
 
2.4%
Other Symbol28513
 
1.6%
Control27510
 
1.5%
Dash Punctuation20015
 
1.1%
Decimal Number16637
 
0.9%
Connector Punctuation8954
 
0.5%
Other values (11)9331
 
0.5%

Most frequent character per category

Other Symbol
ValueCountFrequency (%)
🇹3496
 
12.3%
🇳3490
 
12.2%
🔴3148
 
11.0%
🎙1379
 
4.8%
🔶987
 
3.5%
👈984
 
3.5%
955
 
3.3%
🌐945
 
3.3%
616
 
2.2%
🟠513
 
1.8%
Other values (566)12000
42.1%
Other Letter
ValueCountFrequency (%)
ا169165
15.3%
ل122681
 
11.1%
ي96359
 
8.7%
م68994
 
6.2%
و61830
 
5.6%
ر57006
 
5.1%
ن55141
 
5.0%
ت46151
 
4.2%
ب45390
 
4.1%
ة43609
 
3.9%
Other values (71)340900
30.8%
Lowercase Letter
ValueCountFrequency (%)
e29135
12.5%
a21318
 
9.1%
s20006
 
8.6%
o17378
 
7.5%
i16292
 
7.0%
t14712
 
6.3%
n12579
 
5.4%
r11391
 
4.9%
m10832
 
4.6%
u10535
 
4.5%
Other values (31)68837
29.5%
Uppercase Letter
ValueCountFrequency (%)
S5109
 
11.9%
M4997
 
11.7%
L2801
 
6.5%
N2701
 
6.3%
I2522
 
5.9%
T2495
 
5.8%
A2426
 
5.7%
E2355
 
5.5%
R2151
 
5.0%
F2142
 
5.0%
Other values (21)13124
30.6%
Other Punctuation
ValueCountFrequency (%)
#15876
29.9%
.10757
20.2%
:9708
18.3%
/6690
12.6%
'2276
 
4.3%
؟2118
 
4.0%
"2021
 
3.8%
!1241
 
2.3%
،885
 
1.7%
*409
 
0.8%
Other values (16)1172
 
2.2%
Nonspacing Mark
ValueCountFrequency (%)
ّ2158
42.1%
1472
28.7%
ٔ306
 
6.0%
ً293
 
5.7%
́266
 
5.2%
ُ243
 
4.7%
َ139
 
2.7%
ِ72
 
1.4%
ْ67
 
1.3%
ٕ27
 
0.5%
Other values (7)85
 
1.7%
Format
ValueCountFrequency (%)
399
34.7%
168
14.6%
󠁧162
14.1%
87
 
7.6%
󠁿82
 
7.1%
󠁢82
 
7.1%
󠁥80
 
7.0%
󠁮80
 
7.0%
3
 
0.3%
󠁴2
 
0.2%
Other values (4)6
 
0.5%
Decimal Number
ValueCountFrequency (%)
03631
21.8%
12979
17.9%
22798
16.8%
31697
10.2%
91062
 
6.4%
51015
 
6.1%
8914
 
5.5%
6878
 
5.3%
7844
 
5.1%
4817
 
4.9%
Math Symbol
ValueCountFrequency (%)
|991
62.5%
>273
 
17.2%
208
 
13.1%
+56
 
3.5%
=32
 
2.0%
22
 
1.4%
<3
 
0.2%
~1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
230
73.5%
69
 
22.0%
»14
 
4.5%
Initial Punctuation
ValueCountFrequency (%)
45
61.6%
«17
 
23.3%
11
 
15.1%
Modifier Symbol
ValueCountFrequency (%)
🏻7
58.3%
🏼3
25.0%
`2
 
16.7%
Dash Punctuation
ValueCountFrequency (%)
-20006
> 99.9%
9
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
)387
99.7%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
(376
99.7%
1
 
0.3%
Currency Symbol
ValueCountFrequency (%)
2
66.7%
$1
33.3%
Space Separator
ValueCountFrequency (%)
245618
100.0%
Control
ValueCountFrequency (%)
27510
100.0%
Connector Punctuation
ValueCountFrequency (%)
_8954
100.0%
Modifier Letter
ValueCountFrequency (%)
ـ275
100.0%
Enclosing Mark
ValueCountFrequency (%)
25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Arabic1107235
61.8%
Common404482
 
22.6%
Latin275838
 
15.4%
Inherited5240
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
245618
60.7%
27510
 
6.8%
-20006
 
4.9%
#15876
 
3.9%
.10757
 
2.7%
:9708
 
2.4%
_8954
 
2.2%
/6690
 
1.7%
03631
 
0.9%
🇹3496
 
0.9%
Other values (641)52236
 
12.9%
Arabic
ValueCountFrequency (%)
ا169165
15.3%
ل122681
 
11.1%
ي96359
 
8.7%
م68994
 
6.2%
و61830
 
5.6%
ر57006
 
5.1%
ن55141
 
5.0%
ت46151
 
4.2%
ب45390
 
4.1%
ة43609
 
3.9%
Other values (75)340909
30.8%
Latin
ValueCountFrequency (%)
e29135
 
10.6%
a21318
 
7.7%
s20006
 
7.3%
o17378
 
6.3%
i16292
 
5.9%
t14712
 
5.3%
n12579
 
4.6%
r11391
 
4.1%
m10832
 
3.9%
u10535
 
3.8%
Other values (62)111660
40.5%
Inherited
ValueCountFrequency (%)
ّ2158
41.2%
1472
28.1%
ٔ306
 
5.8%
ً293
 
5.6%
́266
 
5.1%
ُ243
 
4.6%
َ139
 
2.7%
87
 
1.7%
ِ72
 
1.4%
ْ67
 
1.3%
Other values (9)137
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Arabic1113826
62.1%
ASCII642390
35.8%
None19980
 
1.1%
Enclosed Alphanum Sup7835
 
0.4%
Punctuation1518
 
0.1%
Dingbats1508
 
0.1%
VS1478
 
0.1%
Misc Symbols1443
 
0.1%
Geometric Shapes Ext920
 
0.1%
Emoticons532
 
< 0.1%
Other values (6)1365
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245618
38.2%
e29135
 
4.5%
27510
 
4.3%
a21318
 
3.3%
s20006
 
3.1%
-20006
 
3.1%
o17378
 
2.7%
i16292
 
2.5%
#15876
 
2.5%
t14712
 
2.3%
Other values (80)214539
33.4%
Arabic
ValueCountFrequency (%)
ا169165
15.2%
ل122681
 
11.0%
ي96359
 
8.7%
م68994
 
6.2%
و61830
 
5.6%
ر57006
 
5.1%
ن55141
 
5.0%
ت46151
 
4.1%
ب45390
 
4.1%
ة43609
 
3.9%
Other values (52)347500
31.2%
Enclosed Alphanum Sup
ValueCountFrequency (%)
🇹3496
44.6%
🇳3490
44.5%
🇪214
 
2.7%
🇩93
 
1.2%
🇮80
 
1.0%
🇷79
 
1.0%
🇫75
 
1.0%
🇸70
 
0.9%
🇧68
 
0.9%
🇱65
 
0.8%
Other values (19)105
 
1.3%
None
ValueCountFrequency (%)
🔴3148
 
15.8%
é1889
 
9.5%
🎙1379
 
6.9%
ï1319
 
6.6%
🔶987
 
4.9%
👈984
 
4.9%
🌐945
 
4.7%
🔸512
 
2.6%
🌤470
 
2.4%
🔵437
 
2.2%
Other values (494)7910
39.6%
VS
ValueCountFrequency (%)
1472
99.6%
6
 
0.4%
Misc Symbols
ValueCountFrequency (%)
955
66.2%
132
 
9.1%
40
 
2.8%
28
 
1.9%
25
 
1.7%
22
 
1.5%
20
 
1.4%
20
 
1.4%
18
 
1.2%
17
 
1.2%
Other values (30)166
 
11.5%
Dingbats
ValueCountFrequency (%)
616
40.8%
281
18.6%
176
 
11.7%
146
 
9.7%
91
 
6.0%
58
 
3.8%
36
 
2.4%
21
 
1.4%
17
 
1.1%
16
 
1.1%
Other values (9)50
 
3.3%
Geometric Shapes Ext
ValueCountFrequency (%)
🟠513
55.8%
🟡257
27.9%
🟢57
 
6.2%
🟥39
 
4.2%
🟣39
 
4.2%
🟤7
 
0.8%
🟨4
 
0.4%
🟩4
 
0.4%
Punctuation
ValueCountFrequency (%)
399
26.3%
230
15.2%
214
14.1%
168
11.1%
151
 
9.9%
87
 
5.7%
69
 
4.5%
63
 
4.2%
59
 
3.9%
45
 
3.0%
Other values (6)33
 
2.2%
Geometric Shapes
ValueCountFrequency (%)
285
55.0%
208
40.2%
22
 
4.2%
3
 
0.6%
Diacriticals
ValueCountFrequency (%)
́266
89.9%
̈21
 
7.1%
̀6
 
2.0%
̂3
 
1.0%
Tags
ValueCountFrequency (%)
󠁧162
32.9%
󠁿82
16.7%
󠁢82
16.7%
󠁥80
16.3%
󠁮80
16.3%
󠁴2
 
0.4%
󠁣2
 
0.4%
󠁳2
 
0.4%
Emoticons
ValueCountFrequency (%)
😍144
27.1%
😂71
13.3%
😋42
 
7.9%
😷31
 
5.8%
😲30
 
5.6%
🙄26
 
4.9%
😮25
 
4.7%
😘19
 
3.6%
😳16
 
3.0%
😱14
 
2.6%
Other values (24)114
21.4%
Misc Technical
ValueCountFrequency (%)
28
53.8%
11
 
21.2%
9
 
17.3%
3
 
5.8%
1
 
1.9%
Specials
ValueCountFrequency (%)
5
100.0%
Currency Symbols
ValueCountFrequency (%)
2
100.0%

time
Date

Distinct15722
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size267.3 KiB
Minimum2019-09-25 12:41:12
Maximum2021-11-27 09:06:11
Histogram with fixed size bins (bins=50)

timestamp
Date

MISSING

Distinct7122
Distinct (%)93.2%
Missing9467
Missing (%)55.3%
Memory size267.3 KiB
Minimum2019-09-25 11:41:12
Maximum2021-11-22 19:16:40
Histogram with fixed size bins (bins=50)

likes
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING
SKEWED

Distinct1464
Distinct (%)9.9%
Missing2333
Missing (%)13.6%
Infinite0
Infinite (%)0.0%
Mean1433.692365
Minimum0
Maximum577803
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile10
Q131
median81
Q3297
95-th percentile5649
Maximum577803
Range577803
Interquartile range (IQR)266

Descriptive statistics

Standard deviation9433.885096
Coefficient of variation (CV)6.580132061
Kurtosis2036.616141
Mean1433.692365
Median Absolute Deviation (MAD)64
Skewness37.44574063
Sum21181371
Variance88998188.01
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15185
 
1.1%
11174
 
1.0%
17174
 
1.0%
21163
 
1.0%
9161
 
0.9%
13158
 
0.9%
18157
 
0.9%
10147
 
0.9%
8146
 
0.9%
12145
 
0.8%
Other values (1454)13164
77.0%
(Missing)2333
 
13.6%
ValueCountFrequency (%)
03
 
< 0.1%
17
 
< 0.1%
222
 
0.1%
352
 
0.3%
442
 
0.2%
585
0.5%
678
0.5%
7113
0.7%
8146
0.9%
9161
0.9%
ValueCountFrequency (%)
5778031
< 0.1%
5743901
< 0.1%
3384021
< 0.1%
2334161
< 0.1%
2287961
< 0.1%
1557101
< 0.1%
1509621
< 0.1%
1409401
< 0.1%
769801
< 0.1%
760001
< 0.1%

comments
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct769
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean170.2038931
Minimum0
Maximum182227
Zeros1797
Zeros (%)10.5%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median8
Q333
95-th percentile368.7
Maximum182227
Range182227
Interquartile range (IQR)31

Descriptive statistics

Standard deviation2153.172056
Coefficient of variation (CV)12.65054528
Kurtosis5980.791421
Mean170.2038931
Median Absolute Deviation (MAD)7
Skewness71.48847167
Sum2911678
Variance4636149.904
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01797
 
10.5%
11608
 
9.4%
21200
 
7.0%
31062
 
6.2%
4780
 
4.6%
5642
 
3.8%
6611
 
3.6%
7451
 
2.6%
8432
 
2.5%
9382
 
2.2%
Other values (759)8142
47.6%
ValueCountFrequency (%)
01797
10.5%
11608
9.4%
21200
7.0%
31062
6.2%
4780
4.6%
5642
 
3.8%
6611
 
3.6%
7451
 
2.6%
8432
 
2.5%
9382
 
2.2%
ValueCountFrequency (%)
1822271
 
< 0.1%
1822261
 
< 0.1%
323191
 
< 0.1%
147153
< 0.1%
142821
 
< 0.1%
142801
 
< 0.1%
142791
 
< 0.1%
142781
 
< 0.1%
142761
 
< 0.1%
142721
 
< 0.1%

shares
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct670
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean495.522067
Minimum0
Maximum549145
Zeros2508
Zeros (%)14.7%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median4
Q317
95-th percentile321.4
Maximum549145
Range549145
Interquartile range (IQR)16

Descriptive statistics

Standard deviation6283.69629
Coefficient of variation (CV)12.68096157
Kurtosis3615.181094
Mean495.522067
Median Absolute Deviation (MAD)4
Skewness48.03356063
Sum8476896
Variance39484839.07
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
02508
14.7%
12491
14.6%
21659
 
9.7%
31218
 
7.1%
4951
 
5.6%
5699
 
4.1%
6532
 
3.1%
7489
 
2.9%
8408
 
2.4%
9340
 
2.0%
Other values (660)5812
34.0%
ValueCountFrequency (%)
02508
14.7%
12491
14.6%
21659
9.7%
31218
7.1%
4951
 
5.6%
5699
 
4.1%
6532
 
3.1%
7489
 
2.9%
8408
 
2.4%
9340
 
2.0%
ValueCountFrequency (%)
5491451
 
< 0.1%
1818051
 
< 0.1%
1818024
 
< 0.1%
820911
 
< 0.1%
649163
 
< 0.1%
602763
 
< 0.1%
602752
 
< 0.1%
6027411
0.1%
597741
 
< 0.1%
490205
< 0.1%

username
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size267.3 KiB
Mosaïque FM
4461 
Jawhara FM
4401 
Radio Med Tunisie
3393 
Nessma
3045 
Shems FM (page officielle)
1807 

Length

Max length26
Median length11
Mean length12.62722862
Min length6

Characters and Unicode

Total characters216014
Distinct characters29
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRadio Med Tunisie
2nd rowRadio Med Tunisie
3rd rowRadio Med Tunisie
4th rowRadio Med Tunisie
5th rowRadio Med Tunisie

Common Values

ValueCountFrequency (%)
Mosaïque FM4461
26.1%
Jawhara FM4401
25.7%
Radio Med Tunisie3393
19.8%
Nessma3045
17.8%
Shems FM (page officielle)1807
10.6%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
fm10669
27.9%
mosaïque4461
11.7%
jawhara4401
11.5%
tunisie3393
 
8.9%
med3393
 
8.9%
radio3393
 
8.9%
nessma3045
 
8.0%
officielle1807
 
4.7%
page1807
 
4.7%
shems1807
 
4.7%

Most occurring characters

ValueCountFrequency (%)
a25909
12.0%
e21520
 
10.0%
21069
 
9.8%
M18523
 
8.6%
s15751
 
7.3%
i13793
 
6.4%
F10669
 
4.9%
o9661
 
4.5%
u7854
 
3.6%
d6786
 
3.1%
Other values (19)64479
29.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter146100
67.6%
Uppercase Letter45231
 
20.9%
Space Separator21069
 
9.8%
Open Punctuation1807
 
0.8%
Close Punctuation1807
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a25909
17.7%
e21520
14.7%
s15751
10.8%
i13793
9.4%
o9661
 
6.6%
u7854
 
5.4%
d6786
 
4.6%
h6208
 
4.2%
m4852
 
3.3%
q4461
 
3.1%
Other values (9)29305
20.1%
Uppercase Letter
ValueCountFrequency (%)
M18523
41.0%
F10669
23.6%
J4401
 
9.7%
R3393
 
7.5%
T3393
 
7.5%
N3045
 
6.7%
S1807
 
4.0%
Space Separator
ValueCountFrequency (%)
21069
100.0%
Open Punctuation
ValueCountFrequency (%)
(1807
100.0%
Close Punctuation
ValueCountFrequency (%)
)1807
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin191331
88.6%
Common24683
 
11.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a25909
13.5%
e21520
 
11.2%
M18523
 
9.7%
s15751
 
8.2%
i13793
 
7.2%
F10669
 
5.6%
o9661
 
5.0%
u7854
 
4.1%
d6786
 
3.5%
h6208
 
3.2%
Other values (16)54657
28.6%
Common
ValueCountFrequency (%)
21069
85.4%
(1807
 
7.3%
)1807
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII211553
97.9%
None4461
 
2.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a25909
12.2%
e21520
 
10.2%
21069
 
10.0%
M18523
 
8.8%
s15751
 
7.4%
i13793
 
6.5%
F10669
 
5.0%
o9661
 
4.6%
u7854
 
3.7%
d6786
 
3.2%
Other values (18)60018
28.4%
None
ValueCountFrequency (%)
ï4461
100.0%

is_live
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size150.4 KiB
False
17106 
True
 
1
ValueCountFrequency (%)
False17106
> 99.9%
True1
 
< 0.1%

reaction_count
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
MISSING

Distinct1897
Distinct (%)27.6%
Missing10243
Missing (%)59.9%
Infinite0
Infinite (%)0.0%
Mean37205.272
Minimum2
Maximum3962765
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum2
5-th percentile25
Q1128.75
median2527
Q330837
95-th percentile280400.7
Maximum3962765
Range3962763
Interquartile range (IQR)30708.25

Descriptive statistics

Standard deviation145766.4402
Coefficient of variation (CV)3.917897447
Kurtosis251.8596765
Mean37205.272
Median Absolute Deviation (MAD)2492
Skewness13.13364923
Sum255376987
Variance2.124785508 × 1010
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
139281
 
0.5%
3124680
 
0.5%
3102679
 
0.5%
252777
 
0.5%
5943476
 
0.4%
528456
 
0.3%
372351
 
0.3%
252649
 
0.3%
404148
 
0.3%
5944546
 
0.3%
Other values (1887)6221
36.4%
(Missing)10243
59.9%
ValueCountFrequency (%)
23
 
< 0.1%
38
 
< 0.1%
48
 
< 0.1%
510
0.1%
613
0.1%
715
0.1%
810
0.1%
917
0.1%
1015
0.1%
1121
0.1%
ValueCountFrequency (%)
39627652
< 0.1%
26137471
< 0.1%
26137431
< 0.1%
26137421
< 0.1%
26137401
< 0.1%
24155941
< 0.1%
24155931
< 0.1%
17237121
< 0.1%
16736581
< 0.1%
16736321
< 0.1%

Like
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct1678
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11129.06933
Minimum0
Maximum2949058
Zeros10243
Zeros (%)59.9%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3297
95-th percentile43507
Maximum2949058
Range2949058
Interquartile range (IQR)297

Descriptive statistics

Standard deviation72451.71584
Coefficient of variation (CV)6.510132492
Kurtosis589.2107156
Mean11129.06933
Median Absolute Deviation (MAD)0
Skewness19.68747014
Sum190384989
Variance5249251128
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
010243
59.9%
201585
 
0.5%
625184
 
0.5%
88481
 
0.5%
632780
 
0.5%
184877
 
0.5%
5073176
 
0.4%
4350771
 
0.4%
1217661
 
0.4%
201157
 
0.3%
Other values (1668)6192
36.2%
ValueCountFrequency (%)
010243
59.9%
11
 
< 0.1%
24
 
< 0.1%
311
 
0.1%
410
 
0.1%
515
 
0.1%
620
 
0.1%
78
 
< 0.1%
827
 
0.2%
932
 
0.2%
ValueCountFrequency (%)
29490582
< 0.1%
21731341
< 0.1%
21731301
< 0.1%
21731292
< 0.1%
18983061
< 0.1%
18983051
< 0.1%
13262911
< 0.1%
11306331
< 0.1%
11150431
< 0.1%
11150251
< 0.1%

Love
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct604
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2148.585024
Minimum0
Maximum871171
Zeros10979
Zeros (%)64.2%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q310
95-th percentile8012
Maximum871171
Range871171
Interquartile range (IQR)10

Descriptive statistics

Standard deviation19078.48777
Coefficient of variation (CV)8.879559131
Kurtosis805.5897661
Mean2148.585024
Median Absolute Deviation (MAD)0
Skewness25.25020846
Sum36755844
Variance363988695.4
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
010979
64.2%
1585
 
3.4%
2385
 
2.3%
3249
 
1.5%
4179
 
1.0%
5135
 
0.8%
599126
 
0.7%
84111
 
0.6%
6110
 
0.6%
180108
 
0.6%
Other values (594)4140
 
24.2%
ValueCountFrequency (%)
010979
64.2%
1585
 
3.4%
2385
 
2.3%
3249
 
1.5%
4179
 
1.0%
5135
 
0.8%
6110
 
0.6%
760
 
0.4%
873
 
0.4%
955
 
0.3%
ValueCountFrequency (%)
8711712
 
< 0.1%
5167931
 
< 0.1%
5167852
 
< 0.1%
4969481
 
< 0.1%
3957682
 
< 0.1%
3749747
< 0.1%
3749733
< 0.1%
3749726
< 0.1%
3528493
< 0.1%
3528481
 
< 0.1%

Wow
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct290
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.32898813
Minimum0
Maximum14423
Zeros12182
Zeros (%)71.2%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile266
Maximum14423
Range14423
Interquartile range (IQR)2

Descriptive statistics

Standard deviation347.9328188
Coefficient of variation (CV)6.524271901
Kurtosis570.4387694
Mean53.32898813
Median Absolute Deviation (MAD)0
Skewness19.77182667
Sum912299
Variance121057.2464
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
012182
71.2%
1489
 
2.9%
3375
 
2.2%
2289
 
1.7%
41213
 
1.2%
14181
 
1.1%
26155
 
0.9%
20144
 
0.8%
22143
 
0.8%
15124
 
0.7%
Other values (280)2812
 
16.4%
ValueCountFrequency (%)
012182
71.2%
1489
 
2.9%
2289
 
1.7%
3375
 
2.2%
469
 
0.4%
576
 
0.4%
651
 
0.3%
742
 
0.2%
847
 
0.3%
931
 
0.2%
ValueCountFrequency (%)
144231
 
< 0.1%
139041
 
< 0.1%
102071
 
< 0.1%
101661
 
< 0.1%
100132
< 0.1%
80532
< 0.1%
70681
 
< 0.1%
59833
< 0.1%
55014
< 0.1%
49461
 
< 0.1%

Sad
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct279
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.9562752
Minimum0
Maximum46907
Zeros12328
Zeros (%)72.1%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile104
Maximum46907
Range46907
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1922.329296
Coefficient of variation (CV)14.45835703
Kurtosis379.7975557
Mean132.9562752
Median Absolute Deviation (MAD)0
Skewness19.1315035
Sum2274483
Variance3695349.922
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
012328
72.1%
1643
 
3.8%
7262
 
1.5%
12254
 
1.5%
4235
 
1.4%
11219
 
1.3%
115203
 
1.2%
24196
 
1.1%
9194
 
1.1%
2155
 
0.9%
Other values (269)2418
 
14.1%
ValueCountFrequency (%)
012328
72.1%
1643
 
3.8%
2155
 
0.9%
3155
 
0.9%
4235
 
1.4%
531
 
0.2%
648
 
0.3%
7262
 
1.5%
820
 
0.1%
9194
 
1.1%
ValueCountFrequency (%)
469071
 
< 0.1%
395009
0.1%
394993
 
< 0.1%
384482
 
< 0.1%
384461
 
< 0.1%
384431
 
< 0.1%
384353
 
< 0.1%
384333
 
< 0.1%
384213
 
< 0.1%
384112
 
< 0.1%

Angry
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct231
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.57187116
Minimum0
Maximum31304
Zeros12232
Zeros (%)71.5%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile99
Maximum31304
Range31304
Interquartile range (IQR)1

Descriptive statistics

Standard deviation952.9869043
Coefficient of variation (CV)13.69787658
Kurtosis437.9590689
Mean69.57187116
Median Absolute Deviation (MAD)0
Skewness20.46354447
Sum1190166
Variance908184.0398
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
012232
71.5%
1613
 
3.6%
28378
 
2.2%
2360
 
2.1%
12347
 
2.0%
41229
 
1.3%
5212
 
1.2%
6207
 
1.2%
355184
 
1.1%
3153
 
0.9%
Other values (221)2192
 
12.8%
ValueCountFrequency (%)
012232
71.5%
1613
 
3.6%
2360
 
2.1%
3153
 
0.9%
4103
 
0.6%
5212
 
1.2%
6207
 
1.2%
727
 
0.2%
825
 
0.1%
9112
 
0.7%
ValueCountFrequency (%)
313041
 
< 0.1%
200548
< 0.1%
200534
< 0.1%
193702
 
< 0.1%
193661
 
< 0.1%
193651
 
< 0.1%
193632
 
< 0.1%
193624
< 0.1%
193603
 
< 0.1%
193572
 
< 0.1%

Care
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct243
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean235.5775414
Minimum0
Maximum90180
Zeros12435
Zeros (%)72.7%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile600
Maximum90180
Range90180
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3089.142943
Coefficient of variation (CV)13.11306216
Kurtosis686.9447958
Mean235.5775414
Median Absolute Deviation (MAD)0
Skewness25.43157297
Sum4030025
Variance9542804.123
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
012435
72.7%
1508
 
3.0%
12245
 
1.4%
2187
 
1.1%
32176
 
1.0%
543167
 
1.0%
77154
 
0.9%
93151
 
0.9%
150130
 
0.8%
28127
 
0.7%
Other values (233)2827
 
16.5%
ValueCountFrequency (%)
012435
72.7%
1508
 
3.0%
2187
 
1.1%
3125
 
0.7%
459
 
0.3%
547
 
0.3%
632
 
0.2%
715
 
0.1%
812
 
0.1%
912
 
0.1%
ValueCountFrequency (%)
901802
 
< 0.1%
8671616
0.1%
549783
 
< 0.1%
549771
 
< 0.1%
435871
 
< 0.1%
400053
 
< 0.1%
264062
 
< 0.1%
205721
 
< 0.1%
169121
 
< 0.1%
156373
 
< 0.1%

Weekday
Categorical

HIGH CORRELATION

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size267.3 KiB
Wednesday
3121 
Friday
2904 
Tuesday
2358 
Saturday
2355 
Thursday
2303 
Other values (2)
4066 

Length

Max length9
Median length7
Mean length7.22973052
Min length6

Characters and Unicode

Total characters123679
Distinct characters17
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMonday
2nd rowMonday
3rd rowMonday
4th rowMonday
5th rowMonday

Common Values

ValueCountFrequency (%)
Wednesday3121
18.2%
Friday2904
17.0%
Tuesday2358
13.8%
Saturday2355
13.8%
Thursday2303
13.5%
Monday2242
13.1%
Sunday1824
10.7%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
wednesday3121
18.2%
friday2904
17.0%
tuesday2358
13.8%
saturday2355
13.8%
thursday2303
13.5%
monday2242
13.1%
sunday1824
10.7%

Most occurring characters

ValueCountFrequency (%)
d20228
16.4%
a19462
15.7%
y17107
13.8%
u8840
7.1%
e8600
7.0%
s7782
 
6.3%
r7562
 
6.1%
n7187
 
5.8%
T4661
 
3.8%
S4179
 
3.4%
Other values (7)18071
14.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter106572
86.2%
Uppercase Letter17107
 
13.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
d20228
19.0%
a19462
18.3%
y17107
16.1%
u8840
8.3%
e8600
8.1%
s7782
 
7.3%
r7562
 
7.1%
n7187
 
6.7%
i2904
 
2.7%
t2355
 
2.2%
Other values (2)4545
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
T4661
27.2%
S4179
24.4%
W3121
18.2%
F2904
17.0%
M2242
13.1%

Most occurring scripts

ValueCountFrequency (%)
Latin123679
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
d20228
16.4%
a19462
15.7%
y17107
13.8%
u8840
7.1%
e8600
7.0%
s7782
 
6.3%
r7562
 
6.1%
n7187
 
5.8%
T4661
 
3.8%
S4179
 
3.4%
Other values (7)18071
14.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII123679
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
d20228
16.4%
a19462
15.7%
y17107
13.8%
u8840
7.1%
e8600
7.0%
s7782
 
6.3%
r7562
 
6.1%
n7187
 
5.8%
T4661
 
3.8%
S4179
 
3.4%
Other values (7)18071
14.6%

Day
Real number (ℝ≥0)

HIGH CORRELATION

Distinct31
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.32121354
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum1
5-th percentile9
Q117
median21
Q325
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.970297165
Coefficient of variation (CV)0.293796291
Kurtosis0.3620482385
Mean20.32121354
Median Absolute Deviation (MAD)4
Skewness-0.6908379415
Sum347635
Variance35.64444824
MonotonicityNot monotonic
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
241591
 
9.3%
191278
 
7.5%
181104
 
6.5%
231097
 
6.4%
211056
 
6.2%
201038
 
6.1%
27974
 
5.7%
26924
 
5.4%
22915
 
5.3%
17892
 
5.2%
Other values (21)6238
36.5%
ValueCountFrequency (%)
178
 
0.5%
265
 
0.4%
372
 
0.4%
486
 
0.5%
5105
 
0.6%
685
 
0.5%
7108
 
0.6%
8137
0.8%
9179
1.0%
10292
1.7%
ValueCountFrequency (%)
31223
 
1.3%
30236
 
1.4%
29550
 
3.2%
28625
 
3.7%
27974
5.7%
26924
5.4%
25749
4.4%
241591
9.3%
231097
6.4%
22915
5.3%

Month
Real number (ℝ≥0)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.78236979
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum1
5-th percentile3
Q16
median9
Q311
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.07276388
Coefficient of variation (CV)0.3498786721
Kurtosis-0.6896177977
Mean8.78236979
Median Absolute Deviation (MAD)2
Skewness-0.8162594113
Sum150240
Variance9.441877863
MonotonicityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
115332
31.2%
93820
22.3%
122825
16.5%
32527
14.8%
62464
14.4%
1057
 
0.3%
131
 
0.2%
829
 
0.2%
522
 
0.1%
ValueCountFrequency (%)
131
 
0.2%
32527
14.8%
522
 
0.1%
62464
14.4%
829
 
0.2%
93820
22.3%
1057
 
0.3%
115332
31.2%
122825
16.5%
ValueCountFrequency (%)
122825
16.5%
115332
31.2%
1057
 
0.3%
93820
22.3%
829
 
0.2%
62464
14.4%
522
 
0.1%
32527
14.8%
131
 
0.2%

Minute
Real number (ℝ≥0)

ZEROS

Distinct60
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.95627521
Minimum0
Maximum59
Zeros875
Zeros (%)5.1%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q111
median25
Q342
95-th percentile55
Maximum59
Range59
Interquartile range (IQR)31

Descriptive statistics

Standard deviation17.67783205
Coefficient of variation (CV)0.6810619749
Kurtosis-1.223287141
Mean25.95627521
Median Absolute Deviation (MAD)15
Skewness0.1448405279
Sum444034
Variance312.5057462
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0875
 
5.1%
1701
 
4.1%
15438
 
2.6%
2422
 
2.5%
30403
 
2.4%
11386
 
2.3%
14365
 
2.1%
13364
 
2.1%
45349
 
2.0%
31328
 
1.9%
Other values (50)12476
72.9%
ValueCountFrequency (%)
0875
5.1%
1701
4.1%
2422
2.5%
3321
 
1.9%
4305
 
1.8%
5295
 
1.7%
6237
 
1.4%
7220
 
1.3%
8225
 
1.3%
9268
 
1.6%
ValueCountFrequency (%)
59213
1.2%
58137
0.8%
57111
 
0.6%
56169
1.0%
55233
1.4%
54184
1.1%
53222
1.3%
52248
1.4%
51269
1.6%
50308
1.8%

Hour
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct24
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.19933361
Minimum0
Maximum23
Zeros181
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size267.3 KiB

Quantile statistics

Minimum0
5-th percentile6
Q110
median13
Q317
95-th percentile21
Maximum23
Range23
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.009498008
Coefficient of variation (CV)0.379526585
Kurtosis-0.4741898782
Mean13.19933361
Median Absolute Deviation (MAD)4
Skewness-0.1201498407
Sum225801
Variance25.09507029
MonotonicityNot monotonic
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
121465
 
8.6%
111346
 
7.9%
101153
 
6.7%
171128
 
6.6%
71105
 
6.5%
81078
 
6.3%
131072
 
6.3%
161043
 
6.1%
181040
 
6.1%
15944
 
5.5%
Other values (14)5733
33.5%
ValueCountFrequency (%)
0181
 
1.1%
1127
 
0.7%
2139
 
0.8%
374
 
0.4%
456
 
0.3%
5116
 
0.7%
6412
 
2.4%
71105
6.5%
81078
6.3%
9910
5.3%
ValueCountFrequency (%)
23314
 
1.8%
22474
2.8%
21551
3.2%
20661
3.9%
19852
5.0%
181040
6.1%
171128
6.6%
161043
6.1%
15944
5.5%
14866
5.1%

Year
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size267.3 KiB
2021
10576 
2020
5539 
2019
 
992

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters68428
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
202110576
61.8%
20205539
32.4%
2019992
 
5.8%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
202110576
61.8%
20205539
32.4%
2019992
 
5.8%

Most occurring characters

ValueCountFrequency (%)
233222
48.6%
022646
33.1%
111568
 
16.9%
9992
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number68428
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
233222
48.6%
022646
33.1%
111568
 
16.9%
9992
 
1.4%

Most occurring scripts

ValueCountFrequency (%)
Common68428
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
233222
48.6%
022646
33.1%
111568
 
16.9%
9992
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII68428
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
233222
48.6%
022646
33.1%
111568
 
16.9%
9992
 
1.4%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

post_idpost_texttimetimestamplikescommentssharesusernameis_livereaction_countLikeLoveWowSadAngryCareWeekdayDayMonthMinuteHourYear
0979693202944224⚽️ 🎁 1200 دينار كاش للربح مع Tunisiabet.net بعد ايقاف مقابلة #ليون و #مارسيليا ، التفاصيل في الفيديو 👇👇\n\nTunisiabet.net\n#Stade_Med by #Radio_Med\n#tunisiaBet2021-11-22 14:58:112021-11-22 14:08:123.011Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221158142021
15243859242303198اخماد حريق بنزل بالحمامات الشمالية2021-11-22 19:11:202021-11-22 18:11:2012.000Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221111192021
25243473002341822يتجدد موعدكم مع #مروى_المعموري من الاثنين للجمعة في #Be_Cool من ال-19 🕖 حتى ال-21🕘\n👈 تبعونا تلقاو كل ما يهمكم مع برشا موجات ايجابية على #راديو_ماد\n\n-----------------\nSuivez-nous\n🎙 | Live : https://radiomedtunisie.com/LIVE\n📸 | Instagram : https://www.instagram.com/radiomedtn/\n📱 | Facebook : https://www.facebook.com/RadioMedTunisie2021-11-22 18:55:012021-11-22 17:55:012.000Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221155182021
35243801252308997تونس تستضيف فعالية “صنع في ليبيا” لدعم شراكة البلدين2021-11-22 18:46:222021-11-22 17:46:225.001Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221146182021
4396392812167201#L'affiche :\nنقترحو عليكم سلسلة تاريخية من أنجح الأعمال الكورية الجنوبية (المملكة)2021-11-22 18:23:232021-11-22 17:30:166.002Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221123182021
55243740928981696الليلة: أمطار متفرقة بأقصى الشمال2021-11-22 18:21:032021-11-22 17:21:0321.000Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221121182021
6498523994595303🔴مشاركة تونسية متميزة في معرض صنع في ليبيا\n📞محمد داود رئيس الغرفة الفتية الاقتصادية التونسية2021-11-22 18:12:062021-11-22 17:14:3017.002Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221112182021
7427256082311864* "ماسنجر" و"انستجرام" قد لا يحصلان على التشفير الافتراضى حتى 2023.\n* تحديث كروم 96 يتسبب في مشاكل مع تويتر وانستجرام.. اعرف التفاصيل .2021-11-22 17:50:192021-11-22 16:51:396.000Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221150172021
85243665428989246وزيرة الصناعة تبحث مع نظيرها الليبي سبل تحقيق الشراكة الاقتصادية بين البلدين2021-11-22 17:51:352021-11-22 16:51:3534.002Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221151172021
95243620012327121عادل الدعداع يقدم استقالته من رئاسة نادي حمام الأنف2021-11-22 17:35:322021-11-22 16:35:3214.021Radio Med TunisieFalseNaN0.00.00.00.00.00.0Monday221135172021

Last rows

post_idpost_texttimetimestamplikescommentssharesusernameis_livereaction_countLikeLoveWowSadAngryCareWeekdayDayMonthMinuteHourYear
170972165089750288283🔶موضوع علاش و كيفاش: سلوك التونسي أثناء السياقة و مدى احترامو لقواعد الطرقات\n🔸ضيفتنا حنان التميمي عضو المجلس التنفيذي و المكلفة بالشؤون القانونية للجمعية التونسية للوقاية من حوادث الطرقات\n#ShemsFm #AlechOuKifech2021-11-27 01:14:58NaT30.088Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111412021
17098439125243863227🔴أبناء شمس أف أم بصوت واحد: لن نركع.. لن نتراجع2021-11-27 02:15:03NaT124.02931Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111522021
17099931728810897199🔶أخبار الرياضة في الكرونوسبور⚽\n#ShemsFm #ChronoSport2021-11-27 02:15:09NaT20.041Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111522021
171001082514098884808🟠رئيس الحكومة يذكر بالنجاحات الأمنية التي تحققت منذ هجوم باردو\n🔸البنك الدولي يجدّد التزامه بمواصلة دعم تونس ومرافقتها في هذه المرحلة الحسّاسة\n🔸أعوان مجلس النواب ينفّذون وقفة احتجاجية تنديدا ب اعتداءات عبير موسي\n🎙#NewsShemsFm 📻 🇹🇳2021-11-27 02:15:14NaT25.091Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111522021
17101486210715722958🟠الأخبار\n🎙#NewsShemsFm 📻 🇹🇳2021-11-27 03:15:19NaT38.040Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111532021
17102796372897669457🔶الفوندو مع كريمة الشموسة ورياض الزاوش 🎻🎷 #ShemsFm #ElFoundou2021-11-27 04:15:23NaT24.053Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111542021
17103898221760971598🔶آلو مدير : تونس تستعد لإطلاق أول قمر صناعي "تونسي تحدي 1"\n#ShemsFm #LaMatinale2021-11-27 06:15:27NaT124.05869Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111562021
171041757112811155346🔶المهمة : وين وصل تحقيق ال48 ساعة الخاص بصب المياه المستعملة من onas في سد سيدي سالم ؟ شنوة صار ؟ و شنوة لقاو ؟ التفاصيل مع أسامة الشوالي\n#ShemsFm #LaMatinale #Almouhema2021-11-27 06:15:32NaT75.0222Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111562021
171052775076899471023🔶بوغلاب ما يفلت شي : شيلني وأشيلك\n#ShemsFm #LaMatinale #BoughallebMayfaletChay2021-11-27 06:15:36NaT138.06628Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111562021
171063717591268355743🔶Shems Clean\n#ShemsFm #LaMatinale\n\n🔶Shems propre\n#ShemsFm #LaMatinale2021-11-27 06:15:41NaT15.031Shems FM (page officielle)FalseNaN0.00.00.00.00.00.0Saturday27111562021

Duplicate rows

Most frequently occurring

post_idpost_texttimetimestamplikescommentssharesusernameis_livereaction_countLikeLoveWowSadAngryCareWeekdayDayMonthMinuteHourYear# duplicates
10710159218760787796إلغاء بطولة إفريقيا لكرة الطائرة '' سيدات '' في رواندا 🏆💥😮2021-09-18 17:19:372021-09-18 15:19:3721.000NessmaFalse21.021.00.00.00.00.00.0Saturday189191720215
10810159218764542796الرئيس الجزائري يعلن تنكيس العلم إثر وفاة بوتفليقة2021-09-18 17:20:152021-09-18 15:20:15142.0124NessmaFalse171.0142.00.00.026.00.01.0Saturday189201720215
11610159225234982796رياض الشعيبي: رئيس الجمهورية يحاول التحايل على الدستور بعدم إعلانه صراحة تعليق العمل به2021-09-21 23:00:062021-09-21 21:00:0664.0763NessmaFalse103.064.01.01.00.00.00.0Tuesday21902320214
5210158556950187796بيل غايتس : عام 2020 كان مدمرا و الأخبار الجيدة ستأتي مع 2021 !2020-12-23 23:30:502020-12-23 22:30:50169.0148NessmaFalse193.0169.016.00.01.00.00.0Wednesday2312302320203
15910159342410272796عزيزي برج العقرب : يتحدث هذا اليوم عن فرصة للربح أو لإطلاق مشروع كبير تعلق عليه آمالاً كبيرة 😍😘\nأيّا كل واحد يدخل يشوف حظو اليوم شنوة مخبيلو 👈 #نسمة #حظك_اليوم ♈♉♊\nwww.nessma.tv2021-11-16 09:00:012021-11-16 08:00:0179.033NessmaFalse94.079.014.00.00.00.00.0Tuesday16110920213
16610159343659557796الأسد يصدر مرسوما تشريعيا يلغي منصب مفتي الجمهورية2021-11-16 07:52:022021-11-16 06:52:02117.062NessmaFalse122.0117.01.00.00.03.00.0Tuesday161152720213
16710159343731992796الدبيبة: لا يمكن أن نتنازل عن الحق الانتخابي أبدا لكنهم خرجوا بقوانين مفصلة على بعض الأشخاص..2021-11-16 08:50:372021-11-16 07:50:3778.020NessmaFalse81.078.00.00.00.00.01.0Tuesday161150820213
168101593437563577962021-11-16 09:07:222021-11-16 08:07:2250.0202NessmaFalse53.050.00.00.00.02.00.0Tuesday16117920213
16910159343811202796دفع التعاون الاقتصادي الثنائي محور لقاء سمير ماجول بسفير فرنسا2021-11-16 09:50:562021-11-16 08:50:5667.063NessmaFalse71.067.00.00.03.00.00.0Tuesday161150920213
170101593438179577962021-11-16 09:56:332021-11-16 08:56:3319.012NessmaFalse20.019.00.00.00.00.00.0Tuesday161156920213